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The Orbiting Carbon Observatory-2 (OCO-2), scheduled to launch in July 2014, is a NASA mission designed to 
measure atmospheric C0 2 . Its main purpose is to allow inversions of net flux estimates of C0 2 on regional to con- 
tinental scales using the total column C0 2 retrieved using high-resolution spectra in the 0.76, 1.6, and 2.0 pm 
ranges. Recently, it was shown that solar-induced chlorophyll fluorescence (SIF), a proxy for gross primary pro- 
duction (GPP, carbon uptake through photosynthesis), can be accurately retrieved from space using high spectral 
resolution radiances in the 750 nm range from the Japanese GOSAT and European GOME-2 instruments. Here, 
we use real OCO-2 thermal vacuum test data as well as a full repeat cycle (16 days) of simulated OCO-2 spectra 
under realistic conditions to evaluate the potential of OCO-2 for retrievals of chlorophyll fluorescence and also its 
dependence on clouds and aerosols. We find that the single-measurement precision is 0.3-0.5 Wm~ 2 sr~ 1 pm~ 1 
(15-25% of typical peak values), better than current measurements from space but still difficult to interpret on a 
single-sounding basis. The most significant advancement will come from smaller ground-pixel sizes and in- 
creased measurement frequency, with a 100-fold increase compared to GOSAT (and about 8 times higher than 
GOME-2). This will largely decrease the need for coarse spatial and temporal averaging in data analysis and 
pave the way to accurate local studies. We also find that the lack of full global mapping from the OCO-2 only in- 
curs small representativeness errors on regional averages. Eventually, the combination of net ecosystem ex- 
change (NEE) derived from C0 2 source/sink inversions and SIF as proxy for GPP from the same satellite will 
provide a more process-based understanding of the global carbon cycle. 

© 2014 Elsevier Inc. All rights reseved. 


1. Introduction 

Photosynthetic uptake of carbon dioxide by terrestrial vegetation is 
driven by absorbed photosynthetically active radiation (APAR) in the 
400-700 nm wavelength region. The efficiency with which photons 
are used for actual photosynthesis varies because absorbed energy can 
also be dissipated into heat (non-photochemical quenching, NPQ). A 
small fraction is also re-radiated at longer wavelengths (660-800 nm), 
which is termed fluorescence and has been widely used in photosynthe- 
sis research for decades (e.g. Baker, 2008; Campbell, Middleton, Corp, 8; 
Kim, 2008; Corp, Middleton, McMurtrey, Entcheva Campbell, & Butcher, 
2006; Genty, Briantais, & Baker, 1 989; Krause & Weis, 1 984; Moya et al., 
2004). Active pulse-modulated fluorometry allows a direct quantifica- 
tion of photosynthetic efficiency by measuring fluorescence at steady 
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state as well as upon illumination with a saturating light pulse. Even 
though this technique cannot be directly applied from space, steady- 
state sun-induced chlorophyll fluorescence (SIF) also offers the poten- 
tial to place constraints on gross primary production (GPP) and thereby 
gain a more mechanistic understanding of ecosystem carbon exchange 
(Frankenberg, Fisher et al„ 201 1 ). 

Recently, Joiner et al. (201 1 ), Frankenberg, Butz, and Toon (201 1 ), 
Frankenberg, Fisher, et al. (2011), and Guanter et al. (2012) showed 
that SIF can be retrieved based on in-filling of solar absorption features, 
so called Fraunhofer lines. The first real retrievals were performed using 
high-resolution spectra of the oxygen A-band region from the Tanso 
Fourier Transform Spectrometer onboard of the Japanese GOSAT satel- 
lite (Hamazaki, Kaneko, Kuze, & Kondo, 2005; Kuze, Suto, Nakajima, & 
Flamazaki, 2009). Later, Joiner et al. (2012) showed that even moderate 
spectral resolution can be used to infer SIF using either a very broad 
Fraunhofer line at 866 nm (using SCIAMACHY) or applying a larger 
spectral range from 715 to 780 nm using GOME-2 (Joiner et al., 2013). 
In general, there is a trade-off between spectral resolution and spatio- 


http://dx.doi.Org/10.1016/j.rse.2014.02.007 
0034-4257/© 2014 Elsevier Inc. All rights reseved. 




2 


C. Frankenberg et al / Remote Sensing of Environment 147 (2014) 1-12 


temporal sampling. On the one hand, high spectral resolution enables 
more robust and accurate S1F retrievals but on the other hand it sacri- 
fices spatial sampling. GOSAT retrievals (Frankenberg, Butz, et al„ 
2011; Frankenberg, Fisher, et al„ 2011; Guanter et al„ 2012; Joiner 
et al„ 2011, 2012), for instance, are well characterized but cannot pro- 
vide the spatial mapping of GOME-2. Neither will OCO-2 provide full 
spatial mapping even though it records about 8 times more spectra 
than GOME-2. Despite these limitations, GOSAT SIF retrievals have al- 
ready been proven very useful in the analysis of carbon exchange in 
the tropical Amazon (Lee et aL, 2013; Parazoo et al„ 2013). 

The Orbiting Carbon Observatory-2 (OCO-2), to be launched in July 
2014, can alleviate some of the drawbacks of current SIF observations 
and combine the advantages of both GOSAT and GOME-2. It will enable 
SIF retrievals similar to GOSAT but will acquire 24 spectra per second in- 
stead of one every 4 s. This almost 100-fold increase in data density 
compared to GOSAT is combined with much smaller ground-pixels 
(1.3 x 2.25 km 2 instead of a circular footprint with 10 km diameter for 
GOSAT or 80 x 40 km 2 for GOME-2). As such, it has the potential to 
largely improve current SIF retrievals as well as bringing together scien- 
tific communities focused on top-down net ecosystem exchange inver- 
sions based on atmospheric C0 2 measurements with those using 
bottom-up estimates derived from remotely sensed vegetation param- 
eters. Here, we will quantify the potential of OCO-2 using both retrievals 
based on a synthetic full repeat cycle of OCO-2 data as well as real ther- 
mal vacuum test data. The main purpose of this publication is to make 
potential users aware of datasets for vegetation-specific applications 
that can be expected from the OCO-2. The primary mission objective 
of OCO-2 is to measure atmospheric C0 2 with the precision and accura- 
cy needed for source/sink inversions on regional scales. Thus, the 
current user community is composed of atmospheric inversion experts 
and global carbon cycle modelers. The new fluorescence product might 
be of interest to different scientific communities and this paper tries to 
bridge this gap as most previous publications on the OCO-2 focused on 
C0 2 only and scientists might be unaware of the potential of OCO-2 
fluorescence retrievals and their applications in photosynthesis re- 
search and carbon cycle science. The use of atmospheric C0 2 in conjunc- 
tion with fluorescence has just started (Parazoo et al., 2013) and we 
anticipate more synergistic applications in the future. 

2. The OCO-2 satellite 

The OCO-2 instrument is a 3-channel grating spectrometer sched- 
uled to launch into a sun-synchronous orbit (as part of the A-train) in 
mid 2014. At 3 Hz readout rate, it will record high resolution spectra 
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Fig. 2. OCO-2 radiometric performance model in the 0 2 -A band. Noise equivalent spectral 
radiance (NESR) as well as signal-to-noise-ratios (SNR) are shown in black and red, re- 
spectively. A Fourier Transform Spectrometer would have a flat spectral response in 
NESR for a given spectrum (i.e. the NESR at continuum level and in the core of absorption 
features is identical but a grating spectrometer has lower NESR in the cores compared to 
the continuum radiance). 


of the O 2 A-band (0.757-0.775 pm, FWHM = 0.042 nm), a weak C0 2 
band (1.594-1.627 pm, FWHM = 0.076 nm), and a strong C0 2 band 
(2.043-2.087 pm, FWHM = 0.097 nm) with 8 independent along-slit 
focal plane array readouts, from here on denoted as footprints. The 
nominal spatial resolution per footprint is 1.3 x 2.25 km 2 with all 8 
individual footprints covering a 10.3 km full swath width. Fig. 1 shows 
a real 0 2 -A band example spectrum of the OCO-2 instrument. Similar 
to the GOSAT, Fraunhofer lines at 758.8 and 770.1 nm can be used for 
the fluorescence retrieval. 

The current radiometric performance of OCO-2, based on exten- 
sive on-ground calibration, is shown in Fig. 2. The noise-equivalent- 
spectral-radiance at zero signal level will correspond to intensities of 
slightly lower than 0.05 Wm -2 sr - 1 pm -1 . In the ideal case of an entire- 
ly opaque Fraunhofer line, we could thus measure SIF with a precision of 
better than 0.05 Wm _2 sr _1 pm -1 with just a single detector pixel. 

In Figs. 1 -2, we have displayed full intensities under the assumption of 
an unpolarized radiance. However, the OCO-2 is only sensitive to intensity 
with polarization parallel to the slit (Stoke vector components: (I-Q)/2). 
To maximize signal (esp. in glint mode) and minimize polarization errors, 
the OCO-2 thus orients its slit perpendicular to the principal plane defined 
by the sun, the surface footprint, and the instrument aperture. 
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Fig. 1. Exemplary OCO-2 spectrum to be used for chlorophyll fluorescence retrievals. Data 
were taken during a thermal vacuum test at JPL on April 20, 2012 looking into the sun 
using a diffuser plate. The primary SIF retrieval fit windows are indicated by transparent 
green rectangles. 


Fig. 3. Typical OCO-2 swath width (lines) and sin(0) (dashed), where 0 is the relative 
angle between the flight track and slit orientation. Near the equator (depending on sea- 
son), the slit will be aligned along-track, effectively reducing the effective swath width 
to the instantaneous field-of-view of a single detector pixel. In glint mode, swaths widths 
and ground-pixel sizes can be somewhat enlarged owing to the slant viewing geometry. 
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0C02 nadir repeat cycle (16 days) 
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Fig. 4. Typical simulated fluorescence retrieval using a 1 6-day OCO-2 Nadir repeat cycle. The narrow stripes represent a 1 .3 to 1 0.5 km swath width. Realistic retrieval noise and data yield 
has been used based on standard cloud-filtering and current OCO-2 signal-to-noise estimates. 


This behavior within a typical orbit is depicted in Fig. 3. Slit rotation 
reduces the effective swath widths, especially near the equator where 
the slit is aligned along track and OCO-2 footprints basically repeat the 
same ground-pixel (in which case you will have a very narrow - 
down to 1 .3 km in the extreme case - swath but largely increased sam- 
pling within the swath as consecutive footprints will sample the same 
location during the overflight). 

These specific sampling aspects only matter for our specific objectives 
here and are not that important for atmospheric retrievals where mostly 
regional-scale spatial gradients in C0 2 are targeted. Typical OCO-2 repeat 
cycles (16-days) will alternate between Nadir (see Fig. 4) and Glint 


viewing mode and a special target observation mode will be used over 
selected ground-based validation sites, staring at the site during the 
overpass and recording thousands of spectra at varying viewing zenith 
angles. Nadir repeat cycles will always follow exactly the same ground 
track for each repeat cycle (providing a good dataset for change detec- 
tion) while the ground tracks in Glint will have somewhat different 
east-west offsets for each repeat cycle, adding some extended spatial 
coverage. The changing viewing geometries will have to be taken into ac- 
count when interpreting S1F data. Nadir spectra of repeating orbits will 
probably provide the most consistent dataset and might even be useful 
for the quantification of fluorescence from phytoplankton (Behrenfeid 
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Fig. 5. First three leading singular vectors (SV) of the OCO-2 TVAC test dataset for the two chlorophyll fluorescence retrievals windows covering the Fraunhofer lines at 758.6 and 758.8 nm 
(left panel) and 770.1 nm (right panel). The explained variance of each singular value is provided within the respective panels. Only 3 out of the 8 OCO-2 footprints are shown for clarity. 
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& Milligan, 2013). The target mode dataset will enable us to study the bi- 
directionality of the fluorescence (Van derTol, Verhoef, & Rosema, 2009; 
Van der Tol, Verhoef, Timmermans, Verhoef, & Su, 2009). Glint observa- 
tions over land will likely see somewhat lower overall fluorescence 
values compared to Nadir as the glint spot may partially look at shaded 
leaves, being opposite to the vegetation hot-spot (Van der Tol, Verhoef, 
Timmermans, et al„ 2009), which is at the same viewing angle but at 
0° azimuth relative to the incident sunlight. 

These features specific for the OCO-2 have to be kept in mind when 
thinking about vegetation-specific remote sensing questions. The OCO- 
2 is not a full mapper like MODIS or GOME-2 because there will be large 
gaps between orbit paths but sampling along track will be continuous 
and provide an unprecedented dataset with high accuracy and precision 
covering very small footprints. We will be able to apply full spectral 
fitting routines and thus also provide reliable error estimates for 
retrieved fluorescence data. Along the orbit track, we will also be able 
to investigate small-scale gradients (on the order of 5-10 km) as a 
total of about 35 OCO-2 measurements will be recorded while the 
satellite Nadir point advances by 10 km, reducing the standard error 
substantially by a factor 1 / s/n, with n being the number of measure- 
ments constituting the ensemble average. This way, fine spatial gradi- 
ents can be properly characterized even within a single orbit path. 
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Fig. 7. Histogram of retrieved chlorophyll fluorescence averaged over both fit windows 
and all 8 footprints. Expected as well as observed standard deviations in retrieved SIF 755 
are denoted as a exp and <j oils , respectively. The dashed red line represents the expected 
Gaussian distribution of retrieved S1F based on estimated OCO-2 radiometric performance 
and Gaussian error propagation in the average. 


H Footprint 3 ( 755 nm fit window) 



b Footprint 3 (771 nm fit window) 



Fig. 6. Histogram of retrieved chlorophyll fluorescence (SIF 755 ) from the 758 nm (top) and 
770 nm (bottom) fit window in one of the OCO-2 footprints. Expected as well as observed 
standard deviations in retrieved SIF755 are denoted as a exp and a obs , respectively. The 
dashed red line represents the expected Gaussian distribution of retrieved S1F based on es- 
timated OCO-2 radiometric performance. 


3. Expected OCO-2 fluorescence retrieval performance 

Retrieval characteristics were evaluated using approximately 80,000 
real atmospheric spectra measured during the OCO-2 Thermal VACuum 
(TVAC) tests atJPL on April 20, 2012. Data were acquired in direct-sun 
mode, thus no actual chlorophyll fluorescence is apparent in the spectra. 
However, the inversion procedure for chlorophyll fluorescence can still 
be performed in order to evaluate expected and observed error charac- 
teristics for single soundings as well as accuracy (i.e. checking whether 
we indeed retrieve zero fluorescence) based on averages over longer 
time-periods. Here, we use the Singular Value Decomposition (SVD) 
fluorescence fitting approach (Guanter et al., 2012) as it enables very 
fast retrievals and also informs us about detector-specific characteris- 
tics. As explained in detail by Guanter et al. (2012, 2013), the SVD fitting 
approach is based on determining singular vectors explaining most of 
the signal variance in fluorescence-free scenes. Subsequent fitting is 
performed by linear least squares, retrieving the best linear combina- 
tion of the singular vectors and a fluorescence-related contribution. 
We developed a prototype of this methodology for the OCO-2 data 
and performed the SVD training for all 8 OCO-2 footprints separately 
as each of them may behave slightly differently ( in terms of wavelength 
calibration, instrument line-shape, dispersion, noise or absolute radio- 
metric calibration). 



Fig. 8. Single measurement precision error of OCO-2 SIF 755 retrievals in a single footprint in 
the 758 nm (black) and 770 nm (red) spectral windows as a function of continuum radi- 
ance level. 
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Fig. 9. Smoothed time-series of S1F retrievals using the TV AC data from April 20, 2012, 
recorded at JPL. Time is in UTC, straight lines are measurements breaks during which 
the heliostat was re-aligned. 


Fig. 5 shows the leading singular vectors as determined from the 
OCO-2 Thermal VACuum (TV AC) dataset under a large range of viewing 
angles. The first singular vector typically represents the main spectral 
shape and a simple multiplicative factor of this singular vector explains 
most of the signal variance. In the ideal case, the following singular 
vectors represent derivatives of the observed radiance with respect to 
changes in key physical quantities such as Doppler shift, oxygen column 
amount or viewing geometry. As apparent in Fig. 5, the short wave- 
length window, which only encompasses the Fraunhofer lines (devoid 
of oxygen absorption lines) exhibits singular vectors (SV2 and SV3) 
mostly representing a spectral shift due to the Doppler shifts. Spectral 
shifts have a non-linear impact on the observed radiance and the singu- 
lar vectors most likely represent a linear approximation of this effect 
using two terms. The 770 nm window includes two stronger oxygen 
features at 769.8 and 769.9 nm and the varying optical depths during 
TVAC (due to varying solar zenith angles) are represented in the second 
singular vector while the third covers most of the spectral shift effect. 
The advantage of the SVD approach is that these derivatives are directly 
computed from the measurements themselves without the need for 


computationally expensive forward model simulations of the true phys- 
ics of the process. The non-linear effect of the spectral shift is linearized 
with the SVD, enabling a simple linear least squares approach for the re- 
trieval. In addition, subtle uncharacterized instrumental features are 
taken into account at the same time. It is expected that we will perform 
both the SVD (Guanter et al., 2012) and physical retrievals with spectral 
fitting fluorescence (Frankenberg, Butz, et al., 2011) for the OCO-2 
as computational demand even for the full spectral fitting routine is 
almost negligible compared to the demand of the full-physics retrieval 
of the atmospheric C0 2 column. The physics-based spectral fitting rou- 
tine for SIF is based on a simple forward model for the fluorescence 
emissions and detailed modeling of the instrument response. For 
OCO-2, each footprint will be characterized with its own dispersion co- 
efficients, instrument line-shape, and noise characteristics. Apart from 
instrument-specific modifications, the routines are identical to those 
used for current GOSAT retrievals; see Frankenberg, Butz, et al. (2011 ) 
for details. The physics-based spectral fitting routine has the advantage 
of not being sensitive to the training dataset of the SVD and is also inter- 
pretable in terms of its physics-based modeling approach. It may, how- 
ever, be more susceptible to instrumental features, warranting initial 
comparisons between the SVD and physics-based approaches once 
OCO-2 is in orbit. 

To assess predicted and true retrieval precision and accuracy, we 
performed SVD-based fits of chlorophyll fluorescence for the TVAC 
dataset, where we know that the truth is zero. For both the long and 
short wavelength fitting window, we define retrieved SIF at 755 nm, as- 
suming a typical spectral shape of SIF. In our case, we use a fixed spectral 
shape where SIF at 770 nm is a factor 1.76 lower than at 755 nm (only a 
factor 1.1 lower at 758 nm), consistent with model estimates (Van der 
Tol, Verhoef, & Rosema, 2009) and also GOSAT observations. We follow 
the retrieval approach by Guanter et al. (2012) and use the first four 
leading singular vectors in the fitting procedure. The spectrally varying 
detector noises are taken into account in the weighted least squares ap- 
proach, also allowing us to directly compute the expected measurement 
uncertainty matrix. 

3.1. Precision estimates 

Fig. 6 shows histograms of retrieved SIF 755 for a representative 
footprint in the 758 and 771 nm fitting windows. The expected dis- 
tribution based on the current OCO-2 detector noise estimates are 
shown as a red dashed line. One can see that expected and observed 


OC02 nadir repeat cycle (16 days), dp<100hPa 



SIF / (W m~~ micron " 1 sr" 1 ) 
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Fig. 10. Typical simulated fluorescence retrieval using a 1 6-day OCO-2 Nadir repeat cycle, same as Fig. 4 but gridded at 1 x 1 ° and using simple cloud filter, where retrieved surface pressure 
is within ±100 hPa compared to the meteorological forecast. 
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standard deviations differ only by a very small amount but agree to 
within 10-15%. In this particular case, single measurement precision 
would be about 0.5 Wm~ 2 sr _1 pm^ 1 in the 758 nm window and 
0.7 Wm -2 sr -1 pm -1 in the 770 nm window. Typical real SIF 755 values 
observed from GOSAT vary between 0 and 2 Wm~ 2 sr _1 pm -1 across 
the globe, meaning that a single measurement from the OCO-2 will in 
most cases be too noisy to be useful. However, given the much smaller 
OCO-2 footprint size, more extreme values in the S1F might be expected 


so that we don't yet fully rule out the potential of using single- 
soundings in case studies. 

To fully exploit the potential of OCO-2 on the global scale, measure- 
ments will have to be averaged to reduce the standard error in the 
mean. Pure un-correlated precision errors in the mean of n samples 
will get reduced by a factor of 1 /•Jh. 

In Fig. 7, we averaged the SIF retrievals of all 8 footprints as well as 
the two different fit windows and show histograms of retrieved SIF as 



Fig. 11. Sampling bias incurred by OCO-2 due to incomplete mapping of 1 x 1° grid boxes. 
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well as expected standard deviations. As expected, the standard error is 
now largely reduced (to about 0.18 Wm“ 2 sr _1 pm^ 1 ) and retrieved 
values still appear veiy Gaussian with an observed standard deviation 
closely matching the expected single measurement precision error. 
These averaged S1F values now exhibit a noise level low enough to be 
used for scientific interpretation on a scale of 1 0.6 x 2 km. Different av- 
eraging schemes can be envisioned but the main conclusion remains: 
the vast amount of OCO-2 spectra will allow the S1F retrievals with 
much higher precision than that of the GOSAT over smaller spatial ex- 
tents, especially along the track direction. 

Given that the S1F retrievals using real atmospheric test data be- 
haved very much as expected, we can use retrieval error simulations 
to estimate single-measurement precisions expected for real OCO-2 
spectra in the orbit. 

Fig. 8 shows the single-measurement precision expected for a 
single OCO-2 footprint as a function of continuum level radiances ex- 
pected in a typical Nadir orbit. Typical errors range from 0.3 to 
0.5 Wm -2 sr -1 pm~'. A few aspects are noteworthy. First, the TVAC 
data have been recorded at a relatively high signal level as direct sun- 
light via a diffusor was observed while the typical range of Nadir signal 
levels is 10-130 Wm -2 sr -1 pm -1 . TVAC precision errors are thus 
somewhat higher than what we expect in the orbit. The higher dynamic 
range in expected OCO-2 radiances is mainly driven by high radiances 
in glint viewing geometries. Second, the single measurement precision 
in absolute SIF improves with decreasing signal level. At first glance this 
may be somewhat counterintuitive as detector signal-to-noise ratios 
(SNR) and 1 -ct errors are anti-correlated. The error in the relative con- 
tribution of SIF to the overall radiance would indeed increase at lower 
signal levels. The continuum level baseline is very well characterized 
with high-resolution spectra as a multitude of spectral points are 
averaged to determine the baseline. The absolute noise within the 
Fraunhofer lines is then proportional to the error in SIF and as the abso- 
lute noise decreases with signal level, SIF retrievals get more precise. 
Overall, single measurement precision will be better than for GOSAT 
as the SNR will rarely drop below 200 and is 200-700 in most cases 
(while darker spectra in GOSAT have SNR< 100). 

The behavior of the single measurement precision with signal level 
may also have important implications for ocean retrievals. In Nadir 
(as observed also over the ocean during the Nadir repeat cycle), back- 
ground radiances are very small in clear sides, so that SIF can contribute 
a substantial fraction to the observed radiance. Over-ocean fluorescence 
originates from photosynthesizing phytoplankton, and is roughly 


proportional to chlorophyll-a concentration near the ocean surface 
(Neville & Gower, 1977). The highest SIF values over the oceans are ex- 
pected along with the coastlines, which could not be well sampled with 
GOSAT’s observation pattern. The continuous swath of OCO-2 will facil- 
itate these studies. Even though we expect SIF absolute values to be in 
general smaller for the ocean (compared to land), we believe that the 
SIF retrievals over the oceans with the OCO-2 might be feasible and 
complement current MOD1S- and MERlS-based fluorescence estimates 
(Xing et al., 2007). Unfortunately, it bears noting that ocean water ab- 
sorbs light about 6 times more strongly near OCO-2 fluorescence wave- 
lengths of « 760 nm compared to the MODIS observation wavelength of 
677 nm. Therefore, while MODIS observations are sensitive to the chlo- 
rophyll fluorescence in roughly the top two meters of the ocean, OCO-2 
will only be sensitive to fluorescence originating in the top half-meter or 
less of the ocean. In addition, the fluorescence peak at 740 nm is much 
lower than the one at 680 nm, which is used for MODIS fluorescence re- 
trievals and ocean Raman scattering may also play a role. Given the low 
computational demand of the SIF algorithm, retrievals will be per- 
formed in any case over all surface types, also over the ocean. However, 
as stated above, we cannot yet confidently assert that OCO-2 SIF re- 
trievals over the oceans will be scientifically useful. 

32. Accuracy estimates 

The question arises as to which level the assumption of un-correlated 
and purely statistical retrieval noise holds and which accuracy levels one 
might expect from the OCO-2. To test this, we used retrievals of the en- 
tire TVAC dataset and computed a smoothed time-series of retrieved SIF. 

Fig. 9 shows the corresponding plot, independently for the two fit 
windows as well as combined. The precision error in these cell averages 
is on the order of 0.01 Wm~ 2 sr~ 1 pm - 1 but significant deviations from 
0 can be observed, up to 0.05 Wm -2 sr -1 pm -1 . There is also a high 
frequency (about 15 min) oscillatory pattern in the retrieved SIF, 
which may be related to a cyclical periodicity in the solar tracking sys- 
tem. In addition, the OCO-2 slit may not have been fully illuminated dur- 
ing the TVAC. While it shows that very subtle instrument features can 
cause systematic biases of up to 0.05 Wm~ 2 sr _1 pm -1 , we are confi- 
dent that this can be seen as an upper bound for in-orbit performance 
where both the instrument as well as slit illumination are better con- 
trolled. However, we will re-assess SIF retrieval accuracy on-orbit 
using statistics over non-fluorescent targets with variable signal levels 
such as deserts and ice or footprints at surface temperatures well 


OC02 nadir repeat cycle (16 days), no filter 



Fig. 12. SIF retrievals of an entire OCO-2 repeat cycle without application of any cloud filter. Visually, it is almost indistinguishable from Fig. 10. 
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a 0C02 nadir repeat cycle (16 days), no filter 



delta-SIF / (W m'~ micron " 1 sr' 1 ) 

- 0.30 - 0.25 - 0.20 - 0.15 - 0.10 - 0.05 0.00 0.05 0.10 0.15 0.20 0.25 0.30 


b 0C02 nadir repeat cycle (16 days), dpdOOhPa filter 



delta-SIF / (W m ' 2 micron " 1 sr' 1 ) 

- 0.30 - 0.25 - 0.20 - 0.15 - 0.10 - 0.05 0.00 0.05 0.10 0.15 0.20 0.25 0.30 


c 0C02 nadir repeat cycle (16 days), dp<50hPa filter 



delta-SIF / (W m ~ 2 micron " 1 sr' 1 ) 


- 0.30 - 0.25 - 0.20 - 0.15 - 0.10 - 0.05 0.00 0.05 0.10 0.15 0.20 0.25 0.30 
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d 0C02 nadir repeat cycle (16 days), dpdOhPa filter 



-0.30 -0.25 -0.20 -0.15 -0.10 -0.05 0.00 0.05 0.10 0.15 0.20 0.25 0.30 


Fig. 13. Absolute biases in retrieved SIF as a function of a surface-pressure retrieval based cloud-filtering scheme (Taylor et al., 2012). The threshold (dp) is based on the absolute difference 
between the a priori surface pressure from meteorological forecast and the retrieved surface pressure using the entire 0 2 -A band. 


below freezing. In summary, we have shown that most of the noise 
in the retrieved SIF can be readily reduced by averaging and follows 
the 1 /y/n law. However, analysis of variations on the order of only 
0.02-0.05 Wm“ 2 sr _1 pm -1 should be handled with care as accuracy 
errors will become significant. Assuming the empirical linear relation 
between SIF and GPP as observed by Frankenberg, Fisher, et al. (2011), 
the above values correspond to GPP values of only 0.13-0.32 g cm 
“ 2 day -1 . 

4. Expected OCO-2 fluorescence datasets on the global scale 

So far, we have only analyzed the expected OCO-2 performance on a 
single-sounding or ensemble averages. In this section, we will cover as- 
pects relevant to carbon cycle modelers on the global scale, i.e. 

• What impact does the footprint size have on SIF retrievals? 

• What errors on the regional scale are introduced by preferential and 
incomplete sampling of OCO-2 only along a narrow orbit track? 

• What datasets can I expect within a typical OCO-2 repeat cycle, how 
many days of averaging do 1 need at what grid-scale? 

• How do clouds and aerosols affect the SIF retrievals given potentially 
imperfect filtering? What is the trade-off between cloud-filtering, ac- 
curacy and data yield? 

4.1. Impact of footprint size 

A major concern of the community working with remotely sensed 
vegetation datasets is the footprint size. This is due to two factors: 
1 ) Local studies are interested in very specific and potentially small- 
scale features (e.g. agricultural plots) and 2) ground pixels should ideal- 
ly be homogenous as sub-pixel heterogeneity can bias the retrieval of 
vegetation indices. For example, a footprint which is half covered with 
snow and half with vegetation won't have the same NDVI as the average 
of the two individual NDVI of snow and vegetation. Due to non- 
linearities, the averages of retrieved quantities based on individual radi- 
ances Ii and the retrieved quantity based on averaged radiances are not 
the same: 

^NDVI(/ i )^NDVl^/ i J. 


Owing to this inequality, footprints have to be homogenous to be un- 
biased for classical optical remote sensing of vegetation indices. SIF, 
however, is different in its retrieval characteristics as it is only sensitive 
to emissions emanating from the surface, with non-fluorescing targets 
(snow, streets, etc.) not contributing to the signal. We essentially mea- 
sure the emitted radiance integrated over the field-of-view and as such, 
the operators for the SIF retrievals are by definition interchangeable: 

[ SIF(/) = SIF ( [ /). 

In the extreme case, we could thus argue that the measurement of 
the (day-lit) hemispherically integrated SIF from the Earth can provide 
a proxy for the global gross primary production (under the assumption 
of a linear relation, as empirically found by Frankenberg, Fisher, et al. 
(2011)). However, spatial context will of course be lost but this extreme 
example illustrates that spatial resolution for fluorescence retrievals is 
irrelevant unless you want to attribute specific patterns to specific bi- 
omes or local processes. We basically measure the spatially integrated 
photon-flux of a by-product of photosynthesis and the spatial resolution 
depends on the interest of the user community and is not a prerequisite 
in itself. However, OCO-2 will push the spatial resolution of SIF mea- 
surements to a new level, allowing more detailed and local researches 
than is currently possible. But we should also keep in mind that most 
global vegetation modelers - not to speak of inverse modelers using at- 
mospheric C0 2 data - don’t work with high resolution imagery but only 
with down-scaled vegetation index products. 

4.2. Impact of preferential sampling along the narrow OCO-2 swath, what 
coverage and data to expect 

To answer these questions, we analyzed a full repeat cycle (16 days) 
of simulated OCO-2 data for September. We used the OCO-2 orbit sim- 
ulator (see Appendix A), which now includes a fluorescence signal 
that is geographically distributed by linearly scaling 0.5° resolution 
GPP maps from Beer et al. (2010). The simulated data include realistic 
cloud and aerosols fields based on CALIPSO statistics (Winker, Hunt, & 
McGill, 2007) and also include full multiple scattering radiative forward 
modeling of the fluorescence emission emanating from the surface. De- 
tails of the simulations can be found in the Appendix A. As can be seen in 
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Fig. 4, the OCO-2 only samples a very narrow swath with large gaps in- 
between orbit paths. 

In Fig. 1 0, we use exactly the same data but grid them at 1 x 1 °. This 
results in a quasi-global coverage albeit with sampling errors associated 
with each of the larger grid cells. To quantify the sampling bias in this 
potential high-level product and its relevance for regional scale carbon 
cycle modeling, we analyzed high-resolution MODIS GPP data (0.05°) 
and its dependence on preferential sampling at the 1x1° grid. The 
top panel of Fig. 11 shows the high-resolution MODIS GPP dataset re- 
gridded at 1 x 1° based on the full dataset. The middle panel shows 
the grid cell averages based on samples just along with the direct 
Nadir point of the OCO-2 instrument, i.e. with a very limited though sto- 
chastic sampling. The difference between the two is displayed in the 
lower panel, showing that overall only relatively small and spatially un- 
correlated errors exist. The largest differences are observed along coast- 
lines, where grid cells cross the land/ocean boundary and we averaged 
OCO-2-based data only over land pixels while the high resolution 
MODIS GPP was averaged over all pixels. Overall though errors are 
small in most places, showing that despite the narrow swath, OCO-2 
data aggregated to larger scales (e.g. 1 x 1°) are representative of true 
grid cell averages. Even the small errors can be avoided if OCO-2 fluores- 
cence data is gridded at finer spatial resolution, albeit at the expense of 
apparent coverage (gaps in the maps). For visualization, gap-free global 
maps will be preferred but detailed analysis can take the spatial sam- 
pling fully into account. 

It is worth mentioning that OCO-2 will collect as many spectra with- 
in one repeat cycle as the GOSAT does in slightly more than 4 years. This 
was the main barrier of using GOSAT fluorescence retrievals at shorter 
temporal and spatial scales because high single-measurement noise in 
conjunction with low data amount resulted in high standard errors 
and the subsequent need for large-scale averaging to suppress the ap- 
parent noise. The effect of noise is already included in the OCO-2 simu- 
lated retrievals in Fig. 1 0, showing that the OCO-2 will be able to provide 
smooth global distributions of SIF bi-weekly. 


4.3. Impact of atmospheric scattering on SIF retrievals: curse or blessing? 

We have shown in Frankenberg, O'Dell, Guanter, and McDuffie 
(2012) that SIF retrievals (if based on Fraunhofer lines) are very insen- 
sitive to atmospheric scattering, with more than 80% of the emanating 
fluorescence at the canopy scale reaching the detector in space even 
up to optical densities of 4 (as can be seen in Fig. 12, which didn't in- 
clude any cloud filtering at all). Unlike reflectance-based products, the 
SIF retrievals are mostly subdued by atmospheric scattering, mostly 
when optical densities are very high and/or single scattering albedo 
low. In principle, these highly scattering scenes can be easily filtered 
out, especially because the operational OCO-2 preprocessing filters 
(O'Dell et al„ 2012) for atmospheric C0 2 retrievals are very strict and 
only pass very clear scenes (« <0.3 OD). This could minimize a potential 
systematic low-bias in the SIF. The question is, is this desirable and does 
it reflect the true variability in GPP? Aren't we trading accuracy in the 
SIF with a clear-sky bias in our observations? As absorbed photosyn- 
thetic active radiation (APAR) is reduced in highly scattering scenes, 
the true SIF (and GPP) will also be subdued. If we can quantify the signal 
loss in the SIF from the canopy to the top-of-atmosphere (TOA), we 
could thus truly capture changes even under cloudy conditions and 
thereby minimize a clear-sky bias in SIF observations. Further studies, 
especially in regard to changes in fluorescence and photosynthesis 
yield under changing illuminations, will be needed to quantify this ef- 
fect. This will pave the way towards the quantification of photosynthe- 
sis and its efficiency in atmospheric scattering conditions, potentially 
even looking at GPP enhancements in scattered light conditions, 
which has a large control on GPP (Gu et al„ 2002, 2003; Kanniah, 
Beringer, North, & Hutley, 2012). Flowever, we acknowledge that 
more studies are needed towards that long-term goal. 


Fig. 13 shows the bias in the retrieved SIF depending on the choice of 
a cloud-filter threshold. The bias is defined as the difference between re- 
trieved top-of-atmosphere and top-of-canopy SIF. The cloud-filtering 
scheme is part of the operational OCO-2 pre-processing pipeline. The ef- 
fect is in general largest in the tropics with the most persistent cloud- 
cover. One can see that a relatively relaxed threshold of 1 00 hPa already 
largely reduces the biases compared to the no-filtering case while strict 
cloud-filtering ( 10 hPa threshold) minimizes biases but leads to a more 
noisy map as the total number of measurements used is reduced. We 
propose that a filter threshold of 50-100 hPa should be sufficient for 
the cloud-filtering of SIF retrievals. This is much more relaxed than for 
OCO-2 XC0 2 retrievals, where data yields of only 7% are expected. The 
yields based on simulator data for the 100 hPa, 50 hPa, and 10 hPa 
thresholds are about 50%, 36%, and 14%, respectively. 

5. Vegetation indices from OCO-2 

One of the standard retrieval products from the OCO-2 are surface 
albedos and slopes within the 3 bands at 0.76, 1.6, and 2.0 pm. Owing 
to the high spectral resolution, these albedos are unperturbed by gas- 
eous absorption and also implicitly corrected for atmospheric scattering 
by the full-physics 3-band OCO-2 retrieval algorithm (Bosch et al., 2006; 
O'Dell et al., 2012). In addition, cloud and aerosol filtering based on 
the strongly absorbing oxygen band as well as the C0 2 ratio approach 
is very efficient and robust. 

Vegetation water content indices (Fensholt & Sandholt, 2003; Gao, 
1996; Hunt & Rock, 1989) are a good example of the extra informa- 
tion that we can obtain from the OCO-2 atmospherically corrected sur- 
face reflectance data, since the shortwave infrared (SWIR) domain 
(1.4-3 pm) is strongly affected by liquid water absorption, whereas 
the near infrared (NIR) region (0.75-1.4 pm) is relatively insensitive, a 
combination of both bands can be used to assess vegetation liquid 
water content. Following the formulation of the Normalized Difference 
Infrared Index (NDII) described by Fensholt and Sandholt (2003), we 
can write 

pjDI] _ ^NIR ^SWIR 
^NIR + ^SWIR 

where R refers to surface reflectance in the NIR and SWIR regions 
(MODIS channels at 0.86 and 1.64 pm by Fensholt and Sandholt 
(2003), and 0.758 or 0.770 pm and 1.6 pm for OCO-2). Fig. 14 shows a 
temporal series from 06/2009 to 01/2011 of the retrieved NDII derived 



Fig. 14. NDII calculated with GOSAT-FTS data and NDW1 from MODIS 06/2009-01/2011 
for the Amazon region (lat: —10° to 2.5°, Ion: —75° to —55°). The fitted smooth curves 
are derived by using the smoothing spline method. The extreme drought in mid 2010 is 
clearly visible in both indices. 
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from GOSAT-FTS data equivalent to OCO-2 measurements (NIR and 
SW1R at 0.758 and 1.55 pm, respectively) in the Amazon region (lat: 
— 10° to 2.5°, Ion: — 75° to — 55°) together with the MODIS Normalized 
Difference Water Index (NDWI) (Gao, 1996) derived from MODIS re- 
flectance data (NIR and SWIR at 860 nm and 1240 nm, respectively). 
It can be observed that the NDII derived from the GOSAT-FTS shows a 
good agreement with the NDWI from the MODIS, which has been wide- 
ly used in vegetation remote sensing applications (Fensholt & Sandholt, 
2003; Gao, 1 996; Hunt & Rock, 1 989), in particular for the assessment of 
vegetation water stress. In this example, both indices are capturing the 
extreme 2010 drought event in the Western Amazon Basin (Lewis, 
Brando, Phillips, van der Heijden, & Nepstad, 2011). 

There are two other features of OCO-2 data of potential interest for 
vegetation remote sensing: one is the capability to provide very accu- 
rate reflectance and reflectance slope measurements at the longest 
spectral range of the vegetation red-edge window (757 nm), which 
can help to disentangle leaf biochemical constituents from canopy 
structure (Knyazikhin et al., 2012). The other is the smaller sensitivity 
of fluorescence to cloud and aerosol contamination than that of the 
reflectance-based data, which can be a critical improvement in tropical 
rainforest areas where vegetation indices face the greatest difficulties to 
provide reliable signals due to atmospheric effects (Hilker et al., 2012). 

6. Conclusions 

We have used the Thermal VACuum (TV AC) data from the actual 
OCO-2 instrument as well as orbit simulations for a full Nadir repeat 
cycle of 16 days to study the prospects of OCO-2 for remote sensing 
of chlorophyll fluorescence. We found that the instrument per- 
forms very well and will provide solar-induced chlorophyll 
fluorescence at 755 nm (SIF 755 ) with a single-measurement preci- 
sion of about 0.3-0.5 Wm“ 2 sr _1 pm -1 and accuracy of better than 
0.05 Wm _2 sr _1 pm -1 . While these numbers are already better 
than current retrievals from the GOSAT satellite, the biggest advance- 
ment will be achieved by the dramatic increase in measurement 
frequency (100 fold compared to GOSAT) and much smaller 
ground-pixel sizes (1.3 x 2.25 km 2 ). The OCO-2 will thus provide ac- 
curate near-global data for each repeat cycle (as opposed to a coarse 
seasonal average for GOSAT). The sun-synchronous orbit with 
1 :30 p.m. local overpass time will enable mid day SIF measurements 
similar to the GOSAT but with higher fidelity and much reduced stan- 
dard error. The very narrow swath of OCO-2 won't allow for truly 
global retrievals as does GOME-2 (Joiner et al., 2013) but we found 
that this sampling bias will be small when averaged over regional 
(e.g. 1x1°) scales, thus only hampering local studies either not cov- 
ered by the OCO-2 swaths or with gradients of interest not properly 
aligned with the flight track. 

We also studied the impact of scattering on the SIF retrievals on the 
global scale and found that cloud and aerosol filtering for the SIF re- 
trievals can be much more relaxed compared to the strict filtering nec- 
essary for the X(-q 2 retrievals. In fact, even cloudy pixels may be of 
interest for carbon cycle studies as relaxed cloud filters minimize a po- 
tential clear-sky bias in the SIF retrievals. Further studies are needed 
to quantify this trade-off between retrieval and sampling bias. 

Eventually, consolidation of biophysics based fluorescence to GPP 
modeling (e.g. Van der Tol, Verhoef, & Rosema, 2009; Van der Tol, 
Verhoef, Timmermans, et al., 2009) will be needed to fully exploit the 
potential of space-based SIF observations. We hope that OCO-2 will fa- 
cilitate this as SIF validation along the narrow orbit and small footprints 
will actually become feasible for the first time. 
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Appendix A 

Details of the OCO-2 orbit simulator 

The OCO-2 orbit simulator is a modular code, developed at Colorado 
State University (CSU), to provide realistic modeling of reflected short- 
wave radiances as might be measured via satellite sensors at top of 
atmosphere (TOA). Descriptions of the mechanics pertaining to orbit 
paths, viewing geometries and other details that are not particularly rel- 
evant to the current research can be found in O'Brien, Polonsky, O'Dell, 
and Carheden (2009). 

In short, the simulator can be set up to emulate a polar-orbiting 
satellite, including Nadir, Glint, and special Target viewing geometries 
with sampling frequency and FOV the same as that expected from the 
OCO-2. The successive order of interaction (SOI) radiative transfer solv- 
er (Heidinger, O'Dell, Bennartz, & Greenwald, 2006; O'Dell, Heidinger, 
Greenwald, Bauer, & Bennartz, 2006), coupled with the low-stream 
interpolator (LSI) (O'Dell, 201 0) produces scalar radiances at the appro- 
priate spectral bands and with the desired spectral resolution. Polarized 
radiances are determined via a second-order of scattering approxima- 
tion (Natraj & Spurr, 2007). An instrument model is provided based 
on best guess and TVAC testing of the OCO-2 instrument and surface in- 
formation is provided by the MODIS. For the current simulations, mete- 
orological information was provided from the ECMWF reanalysis. 

Of particular relevance to the current research is the modeling of 
clouds and aerosols in the simulations. Due to the rather cumbersome 
nature of the OCO-2 full physics retrieval algorithm, only approximately 
6-7% of the total number of measured soundings will be inverted to 
yield Xqq 2 . Scenes heavily contaminated by cloud will be easily re- 
moved from processing via cloud screening, as described below. There- 
fore, simulations were performed using statistical sampling of a 
database containing optical thicknesses less than approximately 5. 

The cloud and aerosol database in the current work is based on high 
vertical resolution profiles of optical thickness as measured by the 
Cloud-Aerosol Lidar with Orthogonal Polarization (CALIOP) sensor 
aboard the Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Obser- 
vation (CALIPSO) satellite in the A- train (Winker et al., 2007). This is the 
same orbit into which the OCO-2 is planned to be launched. For each 
time and location as the simulator stepped through an orbit, a single 
vertical CALIPSO profile was selected at random from a monthly data- 
base of 2° by 2° latitude/longitude boxes. To best represent the selected 
vertical CALIPSO profile, appropriate cloud and aerosol optical proper- 
ties files, needed in the Mie radiative transfer calculations, were taken 
from a static internal database. 

Cloud screening on the simulated data set was performed using both 
the Oxygen A-band retrieval (AB02 for short) that was detailed by 
(Taylor et al., 2012) and the IMAP-DOAS retrieval described by 
(Frankenberg, Platt, & Wagner, 2005). The AB02 relies primarily on 
the difference between the retrieved surface pressure (p s ) and the esti- 
mated value from the ECMWF forecast. Truncation (lengthening) of 
the optical path due to reflection (multiple scattering) from clouds 
manifests as underestimates (overestimates) of p s . Previous work has 
demonstrated that the AB02 cloud screen alone can remove scenes con- 
taminated with optical thicknesses greater than approximately 0.3 OD, 
except for those with clouds at low altitudes, where path length modi- 
fication is too small relative to the total path (O'Dell et al., 2012). 

The IMAP-DOAS routine utilizes ratios in retrieved C0 2 between the 
weak and strong absorption bands, and similar ratios of H 2 0 to distin- 
guish clear from cloudy scenes. Unpublished work has shown that the 
IMAP-DOAS routine is efficient at removing low, thick clouds, thereby 
making the combination of the two cloud screeners very powerful. 
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The current OCO-2 operations call for these two algorithms to be run on 
the entire OCO-2 data set 1 million soundings per day), the results of 
which will be passed to a scene selection algorithm to determine which 
scenes will be processed by the computationally expensive full physics 
retrieval code. The SIF retrievals, however, will be part of the pre- 
processor and as such not be subject to pre-filtering. 
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